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1 Introduction. 



To form the classical empirical process one starts with iid random variables, 
{Xj : j > 1}, with distribution function, F, and with 

1 n 

^(A) = -J2^eA (1) 

one considers the process ¥ n (y) = P„((— oo,y]). 
By the classical Glivenko-Cantelli Theorem 

sup \¥ n ((-oo,y)) - F(y)\ — > a.s. 
By Donsker's theorem 

{^(¥ n (y)-F(y)):yER} 

converges in distribution in a sense described more completely below. Hence 
limit theorems for such processes, such as the Law of Large numbers and 
the Central Limit theorem (CLT), allow one to asymptotically get uniform 
estimates for the unknown cdf, F(y) = Pr(X < y) via the sample data. 

A more general version of these processes is to replace the indicators of 
half-lines in ([[]) by functions of a "random variable" taking values in some 
abstract space (S, S). More specifically, 

1 n 

{^E(/ra- E /( x )) ( 2 ) 

where the index set, F, is a subset of Coq(S,S) or an appropriate subset of 
C 2 (S,S,P). 

However, even in the case that the class of functions is a class of indicators, 
unlike the classical case, it is easy to see there are many classes of sets, C, 
for which the limit theorem does not hold. As a matter of fact the limiting 
Gaussian may not be continuous, for example, if C = all Borel sets of K or 
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even C = all finite sets of R. And further, even if the limiting Gaussian 
process is continuous, the limit theorem may still fail. 

Luckily, in the case of sets, modulo questions of measurability, there are 
necessary and sufficient conditions for this sequence to converge in distri- 
bution to some mean zero Gaussian process. However, all the nasc's are 
described in terms of the asymptotic behavior of a complicated function of 
the sample, {X n }™ =l . What we attempt to do in this paper is to obtain 
additional sufficient conditions that are useful when X takes values in some 
function space S, and the sets in C involve the time evolution of the stochas- 
tic process X. Of course, C is still a class of sets, but a primary goal that 
emerges here is to provide sufficient conditions for a uniform CLT in terms 
of the process X = {X(t) : t G E} that depend as little as possible on 
the parameter set E. However, classes of sets such as this rarely satisfy the 
Vapnik-Cervonenkis condition. Also, this class of examples arises naturally 
from the study of the median process for independent Brownian Motions (see 
[Swa07] . |Swa08] ) . where he studies the limiting quantile process for indepen- 
dent Brownian motions. This was observed by Tom Kurtz, and the followup 
questions led us to start this study. Here we concentrate on empirical process 
CLT's, and our main result is Theorem 3 below. Another theorem and some 
examples showing the applicability of Theorem 3 appear in section 7. In sec- 
tion 8 there are additional examples which show some obvious conjectures 
one might be tempted to make regarding the CLT formulated here are false. 
In particular, the examples in section 8.4 motivate the various assumptions 
we employ in Theorem 3. As for future work, an upgraded version of [Ver72] 
would perhaps allow one to relate the results obtained here and those of 
Swanson, but this is something to be done elsewhere. 
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2 Previous Results and Some Definitions. 

Let (S,S,P) be a probability space and define (Jl, E, Pr) to be the infinite 
product probability space (S N ,S N , P N ). If Xj : Q — > S are the natural 
projections of Q into the j th copy of S, and J 7 is a subset of C 2 (S, S, P) with 



Let £oo(J~) be the bounded real valued functions on J 7 , with the sup- norm, 
and recall that a Radon measure /i on ^(J 7 ) is a finite Borel measure 
which is inner regular from below by compact sets. Then the functions 
/ ->■ f(Xj) - E(f(Xj)),j > 1, are in MJ 7 ), and we say 7 G CLT(P) if the 
stochastic processes {v n (f),f £ J r }, n > 1, converge weakly to a centered 
Radon Gaussian measure 7p on £ o(^ 7 )- More precisely, we have the following 
definition. 

Definition 1. Let J= C C 2 {P) and satisfy (3). Then J= e CLT(P), or J= is 
a P— Donsker class, if there exists a centered Radon Gaussian measure 7p 
on £ o(^ r ) such that for every bounded continuous real valued function H on 
PociJ 7 ) we have 



where E*H is the usual upper integral of H. If C is a collection of subsets 
from iS, then we say C G CLT(P) if the corresponding indicator functions 
are a P-Donsker class. 

The probability measure 7p of Definition 1 is obviously the law of the 
centered Gaussian process Gp indexed by J 7 having covariance function 



sup \f(s)\ < oo, s G S, 



(3) 



then we define 






EGp(f)G P (g) = Epfg - E P fE P g, f,geF, 
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and L 2 distance 

p P {f, g) = E P ({(/ -g)- E P (f - g)} 2 )Kf, 9 e J". 

Moreover, if 7p is as in Definition 1, then it is known that the process Gp 
admits a version all of whose trajectories are bounded and uniformly pp 
continuous on J 7 . Hence we also make the following definition. 

Definition 2. A class of functions T C /^(P) is said to be P-pregaussian 
if the mean zero Gaussian process {Gp(f) : / G J 7 } with covariance and 
L 2 distance as indicated above has a version with all the sample functions 
bounded and uniformly continuous on J 7 with respect to the L 2 distance 
Pp(f,9)- 

Now we state some results which are useful for what we prove in this 
paper. The first is important as it helps us establish counterexamples to nat- 
ural conjectures one might make in connection to our main result appearing 
in Theorem 3 below. 

Theorem 1. (Gine and Zinn [GZ84] for sufficiency and Talagrand |Tal88] for 
necessity of the A c condition in (ii)) Let A (A) denote the number of distinct 
subsets of A obtained when one intersects all sets in C with A. Then, modulo 
measurability assumptions, conditions, (i) and (ii) below are equivalent. 

(i) The central limit theorem holds for the process 

{^Y t [ I ^-Pr(XeC)]:CeC} 

or more briefly C G CLT(P). 

(ii) 

(a) — ^= — - — — in (outer) probability, and (5) 

\/n 



(6) C is P — pregaussian. (6) 
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A sufficient condition for the empirical CLT, which is used in the proof 
of our main theorem, is given in Theorem 4.4 of |AGO Z88j. 

Theorem 2. |AGOZ88j Let F C C 2 (S,S,P) and let F = sup /£F \f(X)\. 
Assume that ||P/||jr < oo and 

(i) u 2 Pr*(P > u) ->■ as u -> oo, 

(ii) F is P-pregaussian, and 

(iii) there exists a centered Gaussian process {G(f) : / G F} with L 2 dis- 
tance do such that G is sample bounded and uniformly do continuous 
on F, and for some K > 0, all / G J 7 , and all e > 0, 

1 /2 

[supw 2 Pr*( sup |/ - g\ > u)] < Ke. 

u >o {g:d a (gj)<e} 

Then F G CLT(P). 

In this paper we take iid copies {Xj}'^ =1 of a process {A(t) : t G P}, and 
consider 

1 n 

{-^Efew^ " Pr ( X W '• t & E,y € R}, (7) 

with the goal of determining when these processes converge in distribution 
in some uniform sense to a mean zero Gaussian process. For example, if the 
process X has continuous sample paths on P, then S = C(E) and the class 
of sets C in (i) of Theorem [1] consists of the sets {/ G C(E) : fit) < y} 
for t G P and y G R, and we examine when C G CLT(P). As a result of 
the previous definitions, the limiting centered Gaussian process has a version 
with sample paths in a separable subspace of £^(0), and as a consequence 
of the addendum to Theorem 1.5.7 of |vdVW 96j . p. 37, almost all sample 
paths are uniformly L 2 continuous on C provided we identify the indicator 
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functions of sets in C with C itself. Furthermore, we also have that C is 
totally bounded in the L 2 distance with this identification. In addition, the 
following remark is important in this setting. 

Remark 1. The assumption that a centered process {X(t) : t G T} with L 2 
distance d is sample bounded and uniformly continuous on (T, d) is easily seen 
to follow if (T, d) is totally bounded and the process is uniformly continuous 
on (T, d). Moreover, if the process is Gaussian, the converse also holds using 
Sudakov's result as presented in Corollary 3.19 of |LT91j . 

To state and prove our result we will make use of a distributional trans- 
form that appears a number of places in the literature, see |Fer 67j . The paper 
|Riis09j provides an excellent introduction to its history, and some uses. In 
particular, it is used there to obtain an elegant proof of Sklar's theorem, see 
[Ski 73], and also in some related applications. 

Given the distribution function F of a real valued random variable, Y, 
let V be a uniform random variable independent of Y. In this paper we use 
the distributional transform of F defined as 



F(x, V) = F{x~) + V(F(x) - F(x-)), 



and Proposition 1 in |Riis09] shows that 



F(Y, V) is uniform on [0, 1] . 



(8) 



Riischendorf calls F(Y, V) the distributional transform of Y, and we also 
note that F(x, V) is non-decreasing in x. 



3 The Main Conditions 



Let {X(t) : t G E} be a stochastic process as in (JTj), and assume 



p(s,t) = {E{H t -H s ) 2 )*,s 



t G E, 
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where {H(t) : t G E} is some Gaussian process which is sample bounded 
and p uniformly continuous on E. In our main result, see Theorem 3 below, 
we hypothesize the relationship between {X(t) : t G E} and p(s,t),s,t G E 
given in ffTUl) . The importance of this condition in the proof of our theorem 
is 2-fold. First, it allows one to establish the limiting Gaussian process for 
our CLT actually exists. This verifies condition (ii) in Theorem 2 above, and 
is accomplished via a subtle application of the necessary and sufficient condi- 
tions for the existence and sample function regularity of a Gaussian process 
given in ( |Tal05j ) . Secondly, it also allows us to verify that the remaining 
non-trivial condition sufficient for our CLT, namely condition (iii) of Theo- 
rem 2, applies in this setting. This is useful in applications as condition (iii) 
is in terms of a single random element involved in our CLT, and hence is far 
easier to veri fy than the typical condition which depends on the full sample 
of the random elements as in [5j The reader should also note that Theorem 
2 is a refinement of a number of previous results sufficient for the CLT, and 
covers a number of important earlier empirical process papers. 

The existence of such a p(-, ■) is obtained for a number of specific processes 
{X(t) : t G E} in section 7. Nevertheless, given the process X, determining 
whether a suitable p satisfying ffTUl) exists, or does not exist, may be quite 
difficult. However, using (fi~2l) ) we may limit our choice for p in (JTU1) to be 
such that 

p(s,t) > a' 1 snp(E(\I Xt < x - I Xs < x \ 2 ))^ 

for all s,t G E and some constant c G (0, oo). 

Throughout, p(s, t), s,t G E, denotes the L 2 metric of a Gaussian process 
indexed by E and, to simplify notation, we let 

F t {x) = (F t )(x,V),xeR 

be the distributional transform of F t , the distribution function of X t . Note 
that this simplification of notation also includes using X t for X(t) when 
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the extra parenthesis make the latter clumsy or unnecessary. Moreover, this 
variable notation is also employed for other stochastic processes in the paper. 
In addition, for each e > 0, let 

sup Pr*(\F t (X s ) - F t (X t )\ > e 2 ) < Le 2 , (9) 

{s,tGE:p(s,t)<t} 

(the weak L condition) 

and 

supPr*( sup \F t (X s ) - F t (X t )\ > e 2 ) < Le 2 , (10) 

teE {s:p(s,t)<t} 

(the L condition) 

Remark 2. In the L conditions the probabilities involve an e 2 . However, since 
for any constant, C G (0, oo), an L 2 metric, p, is pre-Gaussian iff Cp is pre- 
Gaussian, WLOG we can change to Ce 2 . Moreover, note that any constant 
L sufficient for (flOl) will also suffice for ([9]), and hence to simplify notation 
we do not distinguish between them. 

Lemma 1. Let L be as in (Q, and take s,t £ E. Then, for all x £ R 

Pr(X s < x < X t ) < (L + l)p 2 (M). (11) 

and by symmetry 

Wx t < x - Ix s < x \ = Pr(X 4 < x < X a ) + Pr(X s < x < X t ) (12) 

<2(L + l)p 2 ( S ,t). 

Further, we have 

swp\F t (x) - F 8 (x)\ <2(L + l)p 2 (s,t). (13) 

X 
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Remark 3. As in Lemma 1, Lemmas 2 and 3 below use only the weak L 
condition OH]). Actually for Lemmas 2 and 3 all we need is Lemma 1. However 
in Lemma 4 we need the stronger form as stated in fllOp . 

Proof. Since F t is non-decreasing and x < y implies F t (x) < F t (y), we have 
P(X S <x<X t )< P(F t (X s ) < F t (x),F t (x) < F t (X t )). 

Thus 

P(X S <x<X t )< P(F t (x) < F t (X t ) < F t (x)+p 2 (s,t) } F t (X s ) < F t (x))+ 
P(F t (X t ) > F t (x)+p 2 (s,t),F t (X s ) < F t (x)), 

and hence 

P(X S <x<X t )< P(F t (x) < F t (X t ) < F t (x)+p 2 ( Sj t)) 

+ P(\F t (X t )-F t (X s )\>p 2 (s,t)). 
Now implies for all s, t e E that 

P(\F t (X t ) - F t (X s )\ > p 2 (s,t)) < Lp 2 (s,t), 

since its failure for so,t 6 E and e = p(so,t ) in ([9]) implies a contradiction. 
Therefore, since F t (X t ) is uniform on [0, 1], we have 

P(X S <x<X t )< p 2 (s, t) + Lp 2 (s, t). 

The last conclusion follows by moving the absolute values outside the expec- 
tation. □ 

4 The Main Result. 

Recall the relationship of the X process and p as described at the beginning 
of section 3. Then we have: 
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Theorem 3. Let p be given by p 2 (s, t) = K(H(s) — H(t)) 2 , for some centered 
Gaussian process H that is sample bounded and uniformly continuous on 
(E,p) with probability one. Furthermore, assume that for some L < oo, and 
all e > 0, the L condition (ITU1) holds, and D(E) is a collection of real valued 
functions on E such that P(X(-) G D(E)) = 1. If 

C = {C StX : s G E,x G K}, 

where 

C s>x = {ze D(E) : z(s) < x} 
forse^iG M, then C G CLT(P). 

Remark 4. Note that a sample function of the X-process is in the set C SyX iff 
X s < x. Hence, if we identify a point (s, x) G E x ffi. with the set C SyX , then 
instead of saying C G CLT(P), we will often say 

{/ Xs < x . - Pr{X s <x):seE,xeR} 

satisfies the CLT in t^C) (or in ^(Bxl)). 

Remark 5. At this point one might guess the reader is questioning the various 
assumptions in Theorem 3. First we mention that D(E) is some convenient 
function space. For example, typically the process X has continuous sam- 
ple paths on E, so D(E) = C(E) in these situations. More perplexing, 
at least for most readers, is probably the appearance of the distributional 
transforms {F t : t G E} in the L-condition fflUj) . If the distribution func- 
tions F t are all continuous, then F t = F t ,t G E, and our proof obviously 
holds with F t replacing F t in the L condition. However, without all the 
distribution functions F t assumed continuous, the methods required in our 
proof fail with this substitution. An interesting case where the distribu- 
tional transforms are useful occurs when one has a point to G E such that 
P(X(t) = X(t ) for all t in E) = 1, and F to is possibly discontinuous. In 
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this situation, the L condition (TTUj) holds for the Gaussian process H(t) = g 
for all t G E, g a standard Gaussian random variable, and X(t ) having any 
distribution function F to . Thus Theorem 3 applies, and yields the classical 
empirical CLT when the set S is the real line and the class of sets consists of 
half-lines for all laws F to . A similar result also applies if E is a finite disjoint 
union of non-empty sets, and the process {X t : t G E} is constant on each 
of the disjoint pieces of E regardless of the distribution functions F t ,t G E. 
More importantly, however, is not only does our method of proof fail when 
the L-condition is given in terms of at least one discontinuous F t , but even 
when the empirical process is assumed to be pregaussian, the example in 
section 8.4 shows that the L-condition for the process {F S (X S ) : s G E} is 
not sufficient for the CLT of Theorem 3. One needs to assume something 
more, and our results show th e L-condition for the process {F S (X S ) : s G E} 
is sufficient. 

In section 7 we will provide another theorem showing how Theorem 3 
can be applied, and hence the examples obtained there are motivation for 
its formulation in terms of the L condition (flOj) . The following remark also 
motivates the presence of the process {F t (X s ) : s G E} and the L-condition 
in our CLT. In particular, we sketch an argument that for each t G E a 
symmetric version of this process satisfies a CLT in i^E). This remark is 
meant only for motivation, and in its presentation we are unconcerned with 
a number of details. 

Remark 6. Let 

{I Xs < x - Pr{X s <x):seE,xeR} 

satisfy the Central Limit Theorem in the closed subspace of £oo{E x K) con- 
sisting of functions whose s-sections are Borel measurable on R. We denote 
this subspace by (i^^E x E), and also assume the distribution functions F t 
are all continuous. Then, for each fixed t G E we define the bounded linear 
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operator : £oo,m(E x E) — )■ £oo(E) given by 

0(/)OO= / f{s,x)F t {dx). 



Now by the symmetrization Lemma (Lemma 2.7 in fGZ84|) we have for a 
Rademacher random variable e independent of the empirical process variables 
that {eI Cs , x : s e E,xeR} satisfy the CLT in i^E x R). Taking /(s,x) = 
7(7 S j,. , we have for all t E E fixed that 

0(/)(s) = l-F^s)-) = 1-F t (z{s)), 

as we are assuming the F t are continuous. Therefore the continuous mapping 
theorem (see, e.g., Theorem 1.3.6 in |vdVW96] ) implies that for each t G E, 



1 n 

Z n (s) = -= - ftW), s G -E, 



n 

3=1 



satisfies the CLT in £ 00 {E). In addition, since we are assuming F t = F t , we 
should then have "asymptotically small oscillations" , namely, for every S > 
there exists e > such that 

1 n 

Pr*( sup — VV(F t pq - F 4 (X t )) > 5) < 5. 

p( S ,t)<e V™ ^ 

By using standard symmetry arguments this last probability dominates 
ipr*(max sup \F t (X s ) - F t (X t )\ > sftiS) < 5, 

which (again by standard arguments) implies (modulo multiplicative con- 
stants) 

nPr*( sup \F t (X s ) - F t (X t )\ > ^5) < 5. 

p{s,t)<e 

While this is different from the hypotheses in our Theorem, it indicates that 
the quantity sup p / S) ^ <e \F t {X s ) — F t {X t )\ is relevant to any such theorem. 
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5 Preliminaries for generic chaining. 



Let T be an arbitrary countable set. Then, following Talagrand in |Tal05] 
we have: 

Definition 1. An admissible sequence is an increasing sequence (An) of 
partitions of T such that 

Card A n <N n = 2 2 \ 

The partitions (A n ) are increasing if every set in A n+ i is a subset of some 
set of A n . 

We also have 

Definition 2. If t £ T, we denote by A n (t) the unique element of A n that 
contains t. For a psuedo-metric e on T, and A C E, we write A e (A) to 
denote the diameter of A with respect to e. 

Using generic chaining and the previous definitions, Theorem 1.4.1 of 
|Tal05] includes the following: 

Theorem 4. Let {H(t) : t 6 T} be a centered Gaussian process with L 2 
distance e(s,t) on T, and assume T is countable. Then the process {H(t) : 
t G T} is uniformly continuous and sample bounded on (T, e) with probability 
one if and only if there is an admissible sequence of partitions of T such that 

lim sup V 2 n/2 A e (A n (t)) = 0. (14) 

n>r 

Under the assumption that H is centered Gaussian and uniformly contin- 
uous on (T, e), then, recalling remark 1, it follows that H is sample bounded 
on T is equivalent to (T, e) being totally bounced. Also, an immediate corol- 
lary of this result used below is as follows. 
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Proposition 1. Let Hi and H 2 be mean zero Gaussian processes with L 2 
distances e^, e 2 , respectively, on T. Furthermore, assume T is countable, and 
ei(s,i) < e 2 (s,t) for all s,t GT. Then, if 2 sample bounded and uniformly 
continuous on (T, with probability one, implies ifi is sample bounded and 
uniformly continuous on (T, ex) with probability one. 

Remark 7. One can prove this using Slepian's Lemma (see, e.g., |Fer75j ) . 
However, the immediate conclusion is that Hi is sample bounded and uni- 
formly continuous on (T, e 2 ). Then, a separate argument is needed to show 
the statement in this proposition. Using the more classical formulation for 
continuity of Gaussian processes involving majorizing measures (see, e.g., 
Theorem 12.9 of |LT91j ). the result also follows similarly to what is explained 
below. 

Proof. By the previous theorem {H 2 (t) : t G T} is sample bounded and 
uniformly continuous on (T, e 2 ) with probability one if and only if there 
exists an admissible sequence of partitions of T such that 

lim su P V2"/ 2 A e2 (A n (t)) = 0. 

Since A ei (A n (t)) < A e2 (A n (t)), we have 

lim su P V2"/ 2 A ei (A n (t)) = 0, 

r-+oo t( z T ^ 
n>r 

and hence Theorem 4 implies Hi is sample bounded and uniformly continuous 
on (T, ei) with probability one. Thus the proposition is proven. □ 

6 The Proof of Theorem 3. 

First we establish some necessary lemmas, and the section ends with the proof 
of Theorem 3. Throughout we take as given the assumptions and notation 
of that theorem. 
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6.1 Some Additional Lemmas. 

In order to simplify notation, we denote the L 2 distance on the class of 
indicator functions 

J r = {Ix 3 < x :seE,xeR} 

by writing 

r((s,x),(t,y)) = {E((I Xs < x -Ix t < y ) 2 )}K 

and identifying J 7 with E x R. Our next lemma relates the r-distance and 
the p-distance. It upgrades (fT2l) when x ^ y. 

Lemma 2. Assume that ([9]) holds. Then 

r 2 (( S , x), (t, y)) < min \F u (y) - F u (x)\ + (2L + 2)p 2 (t, s). (15) 

u£{s,t} 

Moreover, if Q denotes the rational numbers, there is a countable dense set 
Eq of (E,p) such that JF$ = {Ix 3 <x '■ (s,x) G E x Q} is dense in (J 7 , r). 

Proof. First observe that by using the symmetry in s and t of the right hand 
term of f|T5|) . we have by applying ffTTj) in the second inequality below that 

T 2 (0,X), (t,y)) = E\I Xt <y - Ix a <x\ < Wx t <y - Ix t <x\ +E|/Xi<x - Ix s <x\ 

= \F t (y) - F t (x)\ +Pi(X s <x< X t ) +Pi(X t <x< X.) 
< \F t (y)-F t (x)\ + (2 + 2L)p 2 (s,t). 
Similarly, we also have by applying ffTTj) again that 

r 2 (( S , x), (f, y)) < \F s (y) - F s (x) \ + (2 + 2L)p 2 (s, t). 

Combining these two inequalities for r, the proof of (1151) holds. Since (E, p) 
is assumed totally bounded, there is a countable dense set Eq of (E,p), and 
hence the right continuity of the distribution functions and (115H then imply 
the final statement in Lemma 2. □ 
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Using Lemma 2 and the triangle inequality we can estimate the r-diameter 
of sets as follows. 

Corollary 1. If t B e B C E and D C R , then 

diam r (5 x D) < 2{(2L + 2) 1/2 diam p (E) + sup \F tg (y) - F tB (x)\ 1/2 }. 

x,ydD 

Lemma 3. Assume that (s, x) satisfies 

r((s,x), (t,y)) = \\I x .<x - txtKyh < e, 
p{s,t) < e, and © holds. Then, for c = (2L + 2) 1 ' 2 + 1, 

\F t {x) - F t (y)\ < (ce) 2 , 

or in other words 

\F t (x) - F t (y)\ < {cm a x{r{{s,x),{t,y)),p{s,t)}) 2 . 
Proof. Using (TTTj) ) in the second inequality below we have 

Wt(y) — Ft^x)^ 2 = \\Ix t <x — Ix t <y\\2 < ||^X s <a; — ^X t <i/||2 + l|-^X s <x _ Fx t < x \\2 

< e + ^Pr(X s < x < X t ) + Pr(X t < x < X s ) 

< e + (2Le 2 + 2e 2 ) 1/2 = [(2L + 2) 1/2 + l]e = ce. 
Hence the lemma is proven. □ 

The next lemma is an important step in verifying the weak-L 2 condition 
in item (iii) of Theorem 2 above, i.e. see Theorem 4.4 of [AGOZ88J. 

Lemma 4. If ( TlPl) holds, c is as in Lemma 3, and 

x), (t, y)) = max{r((s, x), (t, y)),p(s, t)}, 
then for all (t, y) and e > 

Pr*( sup \I Xt < y - Ix s <x\ > 0) < (2c 2 + 2L + l)e 2 . 

{(s,x):X((t,y),{s,x))<e} 
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Proof. First we observe that 

Pr*( sup \Ix t <v ~ J x s <x| > 0) 

{(s,x):\((t,y),(s,x))<e} 



Pr *( SUp Ix t <y,X s >x + Ix a <x,Xt>y > 0)- 

{(s,x):\((t,y),(s,x))<e} 



Again, using the fact that x < y implies F t (x) < F t (y), we have 
Pr*( sup \I Xt < y -Ix 3 <x\ > 0) 

{(s,x):X((t,y),(s,x))<e} 



{(s, a; ):A((t,2/),(s,a : ))<e} 



{(s,x):A((t,y),(s,a;))<e} 



where 



and 



J - Pr *( SU P 7 A(*«)<A(w),F t (s)<F t (X.) > 0) 

{{s,x):\{(t,y),{s,x))<e} 



II - Pr*( SUp i > t (^)<A(x),F t ( lf )<F t (^t) > °)- 

{{s,x):\((t,y),(s,x))<e} 



At this point we use Lemma [3] to see that in (fl6|) we can use 

inf F t (x) > F t (y) - (ce) 2 . 

{(s,x):X((t,y),(s,x))<e} 
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Therefore, by again using (TTOj) 

1 - Pr *^ . W , SU P ^ /A(Jf t )<F t ( V ),F t ( W )-(a)3<F t (Jf.) > °) 
{(s,x):A((t,i/),(s,a:))<e} 



- ^^r, \ W , SU P, /A(X t )<F t ( V ),F t (i,)-(a)3<F t (JfO+ea > °) + ^ 

{(s,x):A((t,2/),(s,a;))<e} 



< Pr(F t (y) - (ce) 2 - e 2 < F t (X t ) < F t {y)) + Le 2 



<(c 2 + L + l)e 2 by p. 

Now, we estimate II in (fT7|) . Using the fact that < -Ft(x) for all x, 

Lemma 3, and our definition of L, we therefore have 

u ^ ^^ S T ^ , 1 ^{X a )<F t (x),F t (y)<F t (X t ) > °) 
{(s,a;):A((t,2/),(s,a;))<e} 



< Pr(P t (X t ) - e 2 < + (ce) 2 , F t (y) < F t (Jf t )) + Le 2 



< (c 2 + L + l)e 2 . 

□ 



6.2 The Construction and the proof of Theorem 3. 

Since (E,p) is totally bounded by Remark 1, take Eq to be any countable 
dense subset of E in the p distance. Then by Theorem 4, Talagrand's con- 
tinuity theorem, there exists an admissible sequence of partitions, B n of E 
for which 
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Fix n. Then, for each B e i3 n _i choose t B e B. Fix the distribution 
function := F tg and /ig the associated probability measure. Put a = 
(A p (B) + 2~ n ) 2 and set z 1 = sup{x e R : F B (x) < a}. We consider two 
cases. 

• Fb(zi) < a and 

• F B (z 1 ) > a. 

In the first case F B {zi) = a. If F B (zi) < a, then by right continuity there 
exist w > Z\ such that F B (w) < a, which contradicts the definition of z\. In 
this case we consider C\ = (—00, zi] and Di = 0. 

In the second case we let C\ = (—00,2:1) and D x = {z{\. In either case 
// s (Ci U D x ) > a. 

Now assume that we have constructed C±, . . . ,Ck and Di, . . . , Dk- There- 
fore we have Zk- If ^((2^,00)) > a, let Zk+i = sup{a; > Zk : F B (x) — 
F B (zk) < a}. If = 00, we set C^+i = (z k , 00) and -Dfc+i = 0- Otherwise, 
if 2fc + i < 00 there are two cases. 

Again, we have two cases. 

• F B (z k+1 ) - F B (z k ) < a and 

• F B (z k+1 ) - F B (z k ) > a. 

In the first case we consider C k+ \ = (z k , z k+ i\ and D k+ \ = 0. In the second 

case we let C k+1 = (z k ,z k+1 ) and D k+1 = {z k+1 }. As before, fJ, B (C k+1 U 

Dk+i) > ot. Hence, there can be at most — steps before {C k) D k \ k cover R. 

a 

2 

Therefore, after eliminating any empty set, we have a cover with at most — 

a 

sets. By our choice of a the cover has at most 2 2ra+1 sets. Hence since we 
have B e <£? n _i, the number of sets used to cover E$ x R is less than or equal 
to 2 2 " 1 2 2n+1 . The reader should note that the points {z k } depend on the 
set B, but we have suppressed that to simplify notation. We now check the 
r-diameters of the non-empty B x C k and B x D k . 



20 



Estimating these diameters by doubling the radius of the sets, the triangle 
inequality allows us to upper bound their radius using one of s and t to be 
t B . Also note that in Lemma 2, or Corollary 1, the term which contains 
\F tB (y) — F tB (x)\ would cause trouble in the case D k ^ 0, since this is only 
known to be > a. Luckily it doesn't appear when D k ^ 0. 

First we consider the r-diameter of sets of the form B xC k when D k = 0. 
Then C k = (z k _ 1:B , z kjB \. Hence for (s, x), (t, y) E B x C k , Corollary 1 implies 

A T {B x (zBt-u z B , k \) < 2 i^2L + 2) 1 ' 2 \{B) + A P (B) + ^ 

When D k ^ 0, then C k = (z k -i,B, z ktB ), so again by Corollary 1 the r- 
diameter of B x C k has an upper bound as in the previous case. 

If D k 7^ 0, then the only element of D k is z k:B and by Corollary 1 we have 

A T (B x D k ) < 2(2L + 2)^diam p ( J B). 

So, in either case 

A T (B x C B , k or D Btk ) < 2 [{2L + 2fl 2 A p {B) + A P (B) + . (19) 

Lemma 5. Let Q n be a sequence of partitions of an arbitrary parameter set 
T with pseudo metric e on T satisfying both 

1. Card(£ n ) < 2 2 " and 

2. lim r ^ 00 sup, eT E„> r 2 n/2 A e (G'„(t)) = 0, 

and set % n := P(Ukkh-i^)' where V(V) denotes the minimal partition 
generated by the sets in V. Then the sequence H n (Notice the n — 1 in 
the union) also satisfies those conditions. 

Proof. The first condition holds since a simple induction on n implies the 
minimal partition 

"Hn = 'P{^l<k<n-lGk) 
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has cardinality at most YYk=i ^ 2 — • The second condition holds since 
the partitions are increasing collections of sets, and hence diam e (if n (t)) < 
diam e (G n _i(t)). □ 

Lemma 6. Let E$ be a countable dense subset of (E,p). Then there exists 
an admissible sequence of partitions {A n : n > 0} of Eq x M such that 

lim sup V2"/ 2 A T (G n ((t,y))) = 0. (20) 

Proof. We construct the admissible sequence of partitions A n as above. More 
precisely, let {£>„ : n > 0} be an increasing sequence of partitions of E Q such 
that ffTSl) holds, and after the construction above we also have (TTTJj) . That is, 
for k > 1 let 

G k = {BxF:B € B fc _i, F G £ B } 

where 

£b = ah sets nonempty} 

and Cj,b, -Dj,s are constructed from B e -B^-i as above. Then, for n > 3 set 

•An = V(U2<k<n-lGk), 

where V{T>) is the minimal partition generated by the sets in V, and for 
n — 1, 2 we take *4. n to be the single set Eq x KL Since the cardinality of the 
partitions Qk defined above is less than or equal to 2 2k 1 2 2k+1 , then a simple 
induction on n implies the minimal partition 

A n = V(U 2 <k<n-lQk) 

has cardinality at most Yl^Zl 2 2k ' 1 2 2k+1 < 2 2 " . By §19§ and Lemma 5 we 
have 

sup V 2^ 2 A T (A n (t, y)) < C{sup V 2 n / 2 \{B n {t)) + V 2"/ 2 2-"}- 
(t,v) * 

v ,y > n>r n>r n>r 

Thus (ITS]) implies r satisfies (I2"0"j) with respect to the sequence of admissible 
partitions *4. n on Eoxl. □ 



22 



Proof of Theorem^ Let Q denote the rational numbers. Then, if we restrict 
the partitions A n of E x R in Lemma 6 to E x Q, we immediately have 

lim sup V2"/ 2 A T (G n ((t,y))) = 0, (21) 

and (i?o x <5,t) is totally bounded. Now let {G( SjX ) : (s,x) £ E x R} be a 
centered Gaussian process with E(G( StX )G(t, y )) = P(X S < x, X t < y). Then, 
G has L2 distance r, and by fl2Tl) and Theorem 4, it is uniformly continuous 
on (E x Q,t). Hence if {H^^ : (s,x) G E x Q} is a centered Gaussian 
process with 

E{H {s>x) H m ) = P{X S <x,X t <y)- P(X S < x)P{X t < y), 

then 

E{(H (S>X) - H {t , y) f) = r 2 ((s,x), (t,y)) - {P(X S < x) - P(X t < y)f). 

Hence the L 2 distance of H is smaller than that of G, and therefore Propo- 
sition 1 implies the process H is uniformly continuous on (E x Q,dn)- By 
Lemma 2 the set Eq x Q is dense in (E x Wt, r), and since 

d H ((s, x), (t, y)) < t((s, x), (t, y)) 

we also have Eq x Q is dense in (E x M, dn)- Thus the Gaussian process 
{H(s,x) '■ (s, x) G E X R} has a uniformly continuous version, which we also 
denote by H, and since (i?, d#) is totally bounded the sample functions are 
bounded on E with probability one. 

If J 7 = {Ix s <x '■ (s,x) e E x R)}, then since 

d H ((s, x), (t, y)) = pp(I Xs <x, Ix t <y), 

the continuity of H on (E x R, g^) implies condition (ii) in Theorem [2] is 
satisfied. Since Ix t <y is bounded, condition (i) in Theorem [2] is also satisfied. 
Therefore, Theorem 3 follows once we verify condition (iii) of Theorem [3 
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To verify (iii) we use Lemma 4. As before, we identify the function 
/ = Ix a <x £ J 7 with the point (s,x) G E x R. Hence, for the centered 
Gaussian process 

{Gf.fe J 7 } 

in (iii) of Theorem [21 for (s, x) G -E x R, we take the process 

In our definition of G we are assuming: 

(a) {H s : s G -E} is a Gaussian process whose law is that of the process 
{H s : s G £7} given in the theorem, and independent of everything in our 
empirical model, and 

(b) : (s, x) G -ExR} is a uniformly continuous and sample bounded 
version of the Gaussian process, also denoted by G( s>x \, but defined above on 
E x Q. The extension to all of E x R again follows by the fact that E x Q 
is dense in (E x R, r). 

Therefore, G is sample bounded and uniformly continuous onExM with 
respect to its L 2 distance 

d 6 ((s, x), (t, y)) = {r 2 ((s, x), (t, y)) + p 2 (s, t)} 1 ^. 

Condition (iii) of Theorem |2] now follows easily from Lemma 4 since for (t, y) 
fixed, 

{(s,x) : \({s,x),(t,y)) < e/2} C {{s,x) : d d ((s,x),(t,y)) < e}, 
and for a random variable Z bounded by one we have 

supt 2 P(\Z\ > t) < P(\Z\ > 0). □ 

7 Another Theorem and some Examples. 

Let {X t : t G E} be a sample continuous process such that 
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(I) sup te£ \Ft(x) — F t (y)\ < k\x — y\@ for all x, y G R, ft < oo, and some 
/9e (0,1]. 

(II) |X 4 — X s | < r0(s, t) for all s,t E E, and for some rj > and all £ > a; 

P(r > x) < x~ v . 

(III) For a G (0, q;o) we have (0(s,£)) a < p a (s,t), s,t G -E, where p a (s,t) 
is the L 2 distance of a sample bounded, uniformly continuous, centered Gaus- 
sian process on (E,p a ), which we denote by {H a (t) : t G E}. 

Theorem 5. Let {X t : t G E} be a sample continuous process satisfying 
(I-III) above. Then 

{Ix s < x ~ Pr(X s <x) : s e E,x eR} 

satisfies the Central Limit Theorem in C°°(E x R). This CLT also holds 
under (I-III) provided (II) is strengthened to hold for all rj > and x > x v , 
and (III) holds for some a G (0,2/(3). 

Remark 8. If the process {X t : t G E} in (III) is a Gaussian process, then the 
CLT of Theorem 5 holds provided (I) is satisfied, (II) is such that \X t — X s \ < 
T0(s, t) for all s,t G E and T < oo, and (III) holds only for some a G (0, 2//3). 
This follows from the final conclusion of the theorem, and the Fernique- 
Landau-Shepp result, which implies exponential estimates for the probability 
in (II). 

Proof. The theorem follows by taking {H(t) : t G E} in Theorem 3 to be 
a suitably chosen {H a (t) : t G E} as in (III), and then verifying condition 
(flUj) . Since (I) implies the distribution functions F t are all continuous, the 
distributional transforms in fflQj) are simply the distributions themselves. 
Therefore, applying (I), with a G (0, «o) to be chosen later, we have 

Pr*( sup \F t (X s ) - F t (X t )\ > e 2 ) < Pr*( sup \X S - X t \ > {^). (22) 
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Hence (II) implies 

2 

Pr*( sup \F t (X s )-F t (X t )\>e 2 )<P(T>C T )H sup <f>(s , t))' 1 )) , 

Pa(s,t)<e K Pa(s,t)<e 

(23) 

and since 

( sup <f){s,t)y 1 >( sup p a (s,t))~£ > e^, 

Pa(s,t)<e Pa(s,t)<e 

(III) therefore implies 

Pr*( sup \F t (X s ) - F t (X t )\ > e 2 ) < e^e« < kh 2 , (24) 

p a (s,t)<e 

provided a > is chosen sufficiently small so that rj(^~ f ) > 2 and < e < e 

-1 2 __1 

is sufficiently small to imply A; e* 9 Q > x . To obtain the final conclusion of 
the theorem take a G (0,2/(3) and i] sufficiently large that r)(l/a — 2/(3) > 2. 

-1 2_ J_ 

Then, for < e < e,, sufficiently small that k f « > we again have 
(24). □ 

Corollary 2. Let {F t : t G [0,T]} be a sample continuous process 7- 
fractional Brownian motion for < 7 < 1 such that Y = with probability 
one, and set X t — Y t + Z, where Z is independent of {Y t : t G [0, T]} and has 
a bounded density function. Then, 

Ux s <* - Pr(X s <x):se[0,T\,xeR} 

satisfies the Central Limit Theorem in £°°([0,T] x M). 

Remark 9. The addition of the random variable Z in the previous corollary 
suffices to make condition (I) hold, and is not used in any other way. Fur- 
thermore, below we will see that something of this sort is necessary, since 
we will show that the CLT of the previous corollary fails for the fractional 
Brownian motion process Y, itself. 
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Proof. The L 2 distance for {X t : t G [0, T]} is given by 

E((X S — X t ) 2 )^ = c 7 |s — t| 7 , s,t 6 [0, T], (25) 

and without loss of generality we may assume the process to be normalized 
so that — 1. Furthermore, it is well known that these processes are Holder 
continuous on [0, T], i.e. for every 9 < 7/2 we have 

|*t-*«| <r|t-s| e ,M G [0,T], (26) 

where 

£(exp{ C r 2 }) < cx), (27) 

for some c > 0. That T has exponential moments is due to the Fernique- 
Landau-Shepp Theorem, and hence conditions (I-III) hold provided we take 
the Gaussian process H a of condition (III) to be an a.9 fractional Brownian 
motion for any 9 < 7/2. Hence the corollary follows from Theorem 5. □ 

Corollary 3. Let / = [0, T] and {Y^ s ^ : (s, t) G 1x1} be a sample continuous 
Brownian sheet such that 3^o,t) = ^?s,o) = 0,0 < s,t < T with probability 
one, and set = Y^ s ^+Z, where Z is independent of {Y( s>t ) '■ (s, t) G 1x1} 
and has a bounded density function. Then, 

{ J x {s , t) <x ~ Pr(X (M) < x) : (s,t) G I x J),x G M} 

satisfies the Central Limit Theorem in C°°((I x J) x R). 

Proof. First of all observe that since Z has a bounded density, and is inde- 
pendent of the Brownian sheet F, we have (I) holding. Furthermore, from 
Theorem 1 in the paper |Yeh60] . these processes are Holder continuous on 
I x I, i.e. for (s,t), (u,v) G I x / and < 7 < 1/2 we have 

\X {S)t) - X M \ < T((^) 2 + C^f)'i. (28) 
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where 

£(exp{ C r 2 }) < oo, (29) 

for some c > 0. That T has exponential moments is due to the Fernique- 
Landau-Shepp Theorem, and hence conditions (I-III) hold provided we take 
the Gaussian process H? ^ of condition (III) to be 

H^ t) = Y s + Z t + Z,(s,t)eIxI, (30) 

where the processes {Y s : s <E 1} and {Z t : t G /} are independent ^-fractional 
Brownian motions, independent of the normal zero-one random variable Z. 
To determine 6 we set 

m) = ((^) 2 +(^m 

where < 7 < 1/2. Then, 

0(M), («,«)) < (i^i + i^ir < ii^r + i^n 

Normalizing the Y s and Z t processes suitably, we have 

Hence for < a < 1 , 7 = 1/4, and < a/8 we have 

a ((s, t), (u, v)) < p a ((s, t), (u, v)) 
Hence the corollary follows from Theorem 5. □ 

8 Examples where our CLT fails. 

8.1 Fractional Brownian Motions. 

Since the class of sets in our CLT arises using the Vapnik-Cervonenkis class 
of half lines, one might think that perhaps if i.i.d copies of the process {X t : 
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t G E} satisfied the CLT in C(E), then the class of sets C of Theorem 3 
would satisfy the CLT(P). Our first example shows this fails, even if the 
process X t is Brownian motion on [0, 1] tied down at t = 0. In this example 
the process fails condition (I) in Theorem 5 since P(X Q = 0) = 1. To prove 
this we show the necessary condition for C to satisfy the CLT(P) appearing 
in (ii-b) of Theorem 1 fails. More precisely, since measurability is an issue 
here, the next lemma shows that there is a countable subclass Cq of sets in C 
such that by Theorem 3 of jTalgg] . C Q (CLT{P). Thus C fails the CLT(P), 
as otherwise all subclasses also are in CLT(P). 

Lemma 7. Let C = {C tjX : < t < 1, oo < x < oo}, where C tjX = {z G 
C[0, 1] : z(t) < x}, and assume {X t : t G [0, 1]} is a sample continuous Brow- 
nian motion tied down at zero. Also, let Cq denote the countable subclass of 
C given by Cq = {C t>y G C : t,y G Q}, where Q denotes the rational numbers. 
Then, for each integer n > 1, with probability one 

A Cq (-E?i, • • ■ , B n ) = 2 n , 

where A C< ^(B 1 , ■ ■ ■ ,B n ) = card{C n {B 1; ■ ■ • ,B n } : C G C Q }, and B ir -- ,B n 
are independent Brownian motions on [0, 1] starting at zero. 

Proof. Suppose one wants the k functions {Bj 1 ,--- ,Bj k } with probability 
one, where 1 < j± < < • • • < jk < n. Let 

u = (ui, ••-,«„) 

where Uj x = Uj 2 = ■■■ = Uj k = 1 and all other Uj = 2. Then ||u|| = 
(E"=i u }) 1 ' 2 = ( 4n - 3A;) 1/2 . Now set v = (v h ■ ■ ■ , v n ), where 

1 

Vjl -"*> = ••• = v h = 2 (4n - 3A;)V2 
and all other Vj = (in _l k) i/2 ■ Then v = u/(2||u||) and ||v|| = 1/2. 
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Let 

(frQO, •••,£,»(*)) 
lJ (2sLL(l/s))V2 

for < s < 1. Then the LIL implies with probability one that 

liminf ||v- W(s)\\ = 0, 

and hence with probability one there are infinitely many rational numbers 
1 1 such that 

C t , x (t) n {Bi, • • • , = {B jl} • • • , .B^}, 

where x(t) e Q for t e Q and 

_ 3(2tLL(l/t))V 2 (2tLL(l/t))V 2 
I W 4(4n - 3£;) 1/2 1 ^ lQ(4n - 3k) 1 / 2 ' 

Since and the set {ji, • • • , jk} were arbitrary, and with probability one we 
can pick out the subset {Bj 1 , ■ ■ • , Bj k }, the lemma follows as the intersection 
of 2 n subsets of probability one has probability one. □ 

The failure of the CLT also holds for all sample continuous fractional 
Brownian motions {Xn{t) : t £ [0, 1]} which are tied down at zero. The proof 
of this again depends on the law of the iterated logarithm for n independent 
copies of this process at t — 0, which then allows us to prove an analogue of 
the previous lemma. The LIL result at t — for a single copy follows, for 
example, by Theorem 4.1 of |GK91j . and then one can extend that result to 
n independent copies by classical proofs as in |Kue76j . The details of this 
last step are lengthy, but at this stage are more or less routine in the subject, 
and hence are omitted. Of course, the CLT for i.i.d. copies of these processes 
is obvious as they are Gaussian. 
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8.2 A uniform CLT Example. 

In the previous examples, when the distribution function F t of X t jumped, 
the oscillation of the processes at that point caused a failure of our CLT. 
Hence one other possible idea is that if the process {X t : t G [0, 1]} is Lip-1 
on [0, 1], then our CLT might hold. For example, this is true for the Lip-1 
process X t = tU,t G [0, 1], where U is uniform on [0, 1]. Moreover, in this 
example the densities of F t still are unbounded near t = 0. 

To see this let X t j,j — 1, • • • , n, t G [0, 1] be i.i.d. copies of X t = tU, t G 
[0,1], and define 

1 n 
V j=i 

where C = {C %y : t G [0,1], y G E} and C t>v = {z G C[0, 1] : z(t) < y}. 
Therefore, W n (Ct )y ) = for all y G R when t — 0, and also when y/t > 1. 
Moreover, if we define Q = {(— oo,r] : < r < 1}, and 

(j){Ic t>y ) = /(-oo,i] if y/t > 1, < t < 1, or y = and t = 0, 

<!>{Ict,y) = I(po,o] if y/t < but not y = and t = 0, 

and 

<f>(I Ct J = I(-oo, y /t] if < y/t < 1, < y < 1, < y < 1. 

Then 0(C) = {Iu<r :0<r<l} = ^ and (f) maps L 2 equivalence classes of C 
onto Q with respect to the law of {X t : t G [0, 1]} for sets in C, and the law 
of U for sets in Q. Now Q satisfies the CLT(C(U)) by the classical empirical 
CLT, i.e. see Theorem 16.4 of |Bil68] . and since <fi preserves covariances 
we thus have W n {C t>y ) converges weakly to the Gaussian centered process 
W(C t , y ) = Y((j)(Ct,y)) on C, where K((-oo,s])) = B(s) - sB(l) is the tied 
down Wiener process on [0, 1], i.e. B(-) is a Brownian motion. 
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8.3 A Lip-1 Example without the CLT. 

In this example we see that the Lip-1 property for {X t : t £ [0, 1]} is not 
always sufficient for our CLT. Here X = 0, and for < t < 1 we define 

oo 

X t = t^2( aj (t) + 2)I E .(t)U, 
i=i 

where 

(i) Ej = (2-J',2-^'- 1 )] for j > 1. 

(ii) {(Xj(t) : j > 1} are independent random processes with (Xj(-) defined 
on Ej such that for j > 1 

P(t*j(t) = sin(27r2 j t),t £ = 1/2, 

and 

P(aj(t) = sm(2n2 j+1 t),t £ Ej) = 1/2. 

(iii) U is a uniform random variable on [3/2, 2], independent of the {a,-}. 

Since the a/s are zero at endpoints of the Ej and we have set X(0) = 0, 
it is easy to see X(t) has continuous paths on [0, 1]. Moreover, X(t) is Lip-1 
on [0, 1] and X(t) has a density for each t £ (0, 1], but our CLT fails. 

The failure of the empirical CLT can be shown by verifying a lemma of 
the sort we have above for Brownian motion, and again we see the lack of 
uniformly bounded densities is a determining factor. 

For each integer n > 1, let Xi, ■ ■ ■ ,X n be independent copies of X, and 
again take C = {C t , x '■ < t < 1, oo < x < oo}, where C tiX = {z £ C[0, 1] : 
z(£) < x}. Also, define Cq as in Lemma 7. Then, we have the following 
lemma, and combined with the argument in section 8.1 we see the empirical 
CLT fails when this X(-) is used. 
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Lemma 8. For each integer n > 1, with probability one 

A c «(X 1 ,---,X ri ) = 2", 

where A C Q(X U ■ ■ ■ ,X n ) = card{C n {X 1? • • • , X n } : C G C Q }. 

Proof. Suppose one wants the k functions {X^, • • • , X ik } with probabil- 
ity one, where 1 < i\ < %2 < • • • < %k < n. If we write 

oo 

x i (t) = tJ2KA t ) + 2 ) I E j ( t ) u i 

3=1 

where the {a it j : j > 1} are independent copies of {(Xj(t) : j > 1} and 
{Ui : i > 1} are independent copies of U, independent of all the cc^'s, then 
this can be arranged by taking 

Oiij(t) = sin(27r2 J+1 t),i e {ij, • • • ,i fc }, 

and 

aij(t) = sin(27r2 j t), i e {1, • • • , n} n {ii, • • • , i fc } c , 

provided we set t = tj = 2~ j + \(2^-^ - 2~ j ). The probability of this 
configuration on the interval Ej is l/2 n and hence with probability one the 
Borel-Cantelli lemma implies there are infinitely many (random) {tj J, 0} 
such that 

a 'h,j(tj) = ■ ■ ■ = CHi k: j(tj) = 0, 

and 

a i,j\tj) = 1 

for all other i e {1, • • • ,n}. Thus with probability one there are infinitely 
many rational numbers t \, such that 

Ct,x(t) n " ' ' x n } = {x h , • • • , X ik }, 
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provided 

, , 17* 
x = x(t) = ——. 

4 

Of course, x(t) is then also in Q, and since k and the set ■ ■ ■ , i^} were 
arbitrary, and we can pick out {X^ , • • • , Xi k } with probability one using sets 
in Cq, the lemma follows as the intersection of 2 n subsets of probability one 
has probability one. 

To see X{t) is Lip-1 on [0, 1], observe that since the intervals {Ej : j > 1} 
are disjoint, X(t) is differentiable on there interiors, and an easy computation 
implies 

sup sup |A"'(f)| < [4tt + 3]C/. 

j>l 2-3<t<2-0- 1 ) 

Furthermore, X(t) is continuous on [0,1], so we can apply the mean value 
theorem on each of these intervals. 

If0<s<t<lwe either have s, t e Ej for some j > 1 or s G Ej and 
t G E k for j > k. If < s < t < 1, then since t(aj(t) + 2)U is differentiable 
on the interior of Ej and continuous on Ej the mean value theorem implies 

\X(t)-X(s)\ < (Air + 3)U\t-s\. 

If s G Ej and t <E Ek with j > k, then by applying the mean value 
theorem in each interval we obtain 

\X(t)-X(s)\<\X(s)-X(2-^ i y)\ 
j-k-i 

+ l^(2~ 0_1) ) - X(2-(^ 1 ^)| + \X(2' k ) - X(t)\ 

r=l 

< (An + 3)U\t- s\. 

If s = 0, then since X(0) = 0, the proof is immediate. Hence X(t) is Lip-1 as 
claimed, and the Lipschits constant is bounded by (4tt + 3)U with probability 
one. Furthermore, since U is uniform on [3/2, 2] and independent of the {a,}, 
then X(t) has a density for each t G (0, 1]. 
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8.4 Variations of the L-condition and the CLT. 

Here we produce examples where the sets C, or more precisely the class 
of indicator functions given by C, are P-pregaussian, and yet C^CLT(P). 
More importantly, they also satisfy the modified L-condition, i.e. we say 
{X s : s G E} satisfies the modified L-condition if for alH G E and e > 

Pr*( sup \F t {X s ) - F t (X t )\ > e 2 ) < Le 2 . (31) 

p(s,t)<e 

Hence these examples also provide motivation for the use of the distributional 
transforms in the L-condition of ( TTUj) used in Theorem 3. 

Notation for the examples in this subsection is as follows. Let E = 
{1, 2, 3, • • • } and assume D(E) = {z : z{t) = or 1, t G E} with C = {C t , y : 
t G E,y G R}, where C tjV = {z G D(E) : z(t) < y}. Then, since the 
functions in D(E) take only the values zero and one, we have 

c = c uc u 

where C a = {C %a ■ t G E} and for t G E, a = 0, 1, C t , a = {z G D(E) : z{t) = 
a}. Also, let £ denote the minimal sigma- field of subsets of D(E) containing 
C, and let P denote the probability on (D(E), S) such that P{C t ,i) = Pt and 
the events {C t ,\ : t G E} are independent events, i.e. P is a product measure 
on the coordinate spaces of D(E) = {0, 1} N with the t th coordinate of D(E) 
having the two point probability that puts mass pt on one, and 1 — pt on 
zero. 

Proposition 2. Let C be defined as above. Then, 

(i) C is P-pregaussian whenever p t = o((log(t + 2))" 1 ) as t — > oo. 

(ii) C G CLT(P) if and only if for some r > 

oo 
i=l 
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(iii) If p t = (log(t + 2))~ 2 , and {H(t) : t G P} consists of centered 
independent Gaussian random variables with E(H(t) 2 ) = (log(t + 2))~i, 
then {X s : s G E} satisfies the modified L-condition, C is P-pregaussian, 
and C^CLT. In particular, in view of Theorem 3, it does not satisfy the 
L-condition. 

Proof. Since the events of Co are complements of those in Ci, it is easy to see 
that C G CLT(P) if and only if d G CLT(P), and since Theorem 3.8.1 of 
|Dud9 9j implies finite unions of classes of sets that satisfy the CLT(P) are 
in CLT(P), we have C G CLT(P) if and only if d G CLT(P). Therefore, 
since the events of C\ are independent, Theorem 3.9.1 in |Dud99] . p. 122, 
implies that C\ G CLT(P) if and only if for some r > 

oo 

X>(i-p t )r<oc. 

Hence (ii) holds. 

Now the centered Gaussian process {Gp(C) : C G C\} = {Gp{C t ,i) : t G 
E}, and since the random variables {Gp(Ct,i) : t G E} are mean zero and 
E(Gp(C t) i) 2 ) = Pt(l — Pt), we have C\ is P-pregaussian provided 

p t = o((log(t + 2)) _1 ) as t ^ oo. 

Similarly, taking complements we have{Gp(C) : C G C } = {Gp(C t) o) : t G 
P} is also P-pregaussian under this condition for p t , and hence C = Cq U Ci 
is P-pregaussian whenever p t = o((log(t + 2))" 1 ) as t — > oo. Hence (i) holds. 

To verify (iii) we take p t = (log(t + 2))" 2 , and {H(t) : t G P} to be 
centered independent Gaussian random variables with E(H(t) 2 ) = (log(t + 
2))"i. If p 2 {s,t) = P((P(s) - H{t)) 2 ), then, for s^t, 

p 2 (s,t) = (log(s+2))-f + (log(t+2))-i > max{(log(s+2))-f,(log(t + 2))-f}. 

In addition, we have \F t (X s ) - F t (X t )\ = if X s = X t = or X s = X t = 1, 
and |P t (X s ) - F t (X t )\ = p t if X t ^ X s . 
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Therefore, for all t G E fixed and e > we have 

Pr*( sup \F t (X s ) - F t (X t )\ > e 2 ) = if p t < e 2 , 

{s:p(s,t)<e} 

and 

Pr*( sup \F t (X s ) - F t (X t )\ > e 2 ) < Pr*( sup I Xt ^x s > 0) for p t > e 2 . 

{s:p(s,t)<e} {s:p(s,t)<e} 

Of course, E countable makes the outer probabilities in the above ordinary 
probabilities, but for simplicity we retained the outer probability notation 
used in (jUj) and (jTUJ). Now p(s,t) < e implies 

max{(log(s + 2))~t, (log(t + 2))~i} < e. 

Thus if p t = (log(t + 2))~ 2 > e 2 we have {sup {s:p(Sjt) < £} l Xt +x s > 0} = 0. 
Combining the above we have for each fixed t £ E and e > that 

Pr*( sup \F t (X s )-F t (X t )\>e 2 )=0, 

{s:p(s,t)<e} 

and hence the modified L-condition for {X s : s G E} holds. Thus (iii) follows. 

□ 

9 Appendix on Talagrand's Continuity Re- 
sult for Gaussian Processes. 

The following result appears in [Tal05] without proof. There is a curious 
wording at the end of the first sentence in its statement there, which after 
closer inspection suggests that cutting and pasting led to something being 
omitted. Hence we include a proof, where we have added the necessary 
assumption of total boundedness of the parameter space to the statement of 
the theorem. 
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Theorem 6. Let {X t : t G T} be a centered Gaussian process with L 2 
distance d(s,t),s,t e T, where T is countable and (T, d) is totally bounded. 
Then, the following are equivalent: 

(i) X t is uniformly continuous on (T, d) with probability one. 

(ii) We have 

lim£( sup (X s -X t )) = 0. 

d(s,t)<e 

(iii) There exists an admissible sequence of partitions of T such that 

lim supV 2 n ' 2 k(A n (t)) = 0. 

Proof. First we will show (i) and (ii) are equivalent. If (ii) holds, then by 
Fatou's lemma we have 

= lim E{ sup \X S -X t \)> ^(liminf sup \X S -X t \). 

n ^°° d{s,t)<l/n d(s,t)<l/n 

Thus, with probability one 

liminf sup \X S — X t \ = 0, 

n ^°° d(s,t)<l/n 

and since the random variables sup d(s t)<i/n \X S — X t \ decrease as n increases, 
this implies with probability one 

lim sup \X S — X t \ =0, 

n ^°° d(s,t)<l/n 

which implies (i). 

If we assume (i), then since (T, dx) is assumed totally bounded, we have 

Z = sup \X t \ < oo 
teT 

with probability one, and the Fernique-Landau-Shepp Theorem implies Z is 
integrable. Since 

sup \X S -X t \ < 2Z, 

d(s,t)<e 
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and (i) implies 

lim sup \X S — Xt\ =0 

d(s,t)<e 

with probability one, the dominated convergence theorem implies (ii). Thus 
(i) and (ii) are equivalent. 

Now we assume (i) and (ii), and choose Sk I such that 

supE sup X t < supE sup (X t — X s ) < E sup (X t - X s ) < 2~ k . 

s {t:d(t,s)<e k } a {t:d(t,s)<e k } d(s,t)<e k 

Since (T,d) is totally bounded, and (i) holds, by Theorem 2.1.1 of |Tal05] 
there exists an admissible sequence of partitions {A n : n > 0} of (T, d) such 
that for a universal constant L we have 

-J- sup V2"/ 2 A(A n (t)) < E(supX 4 ). 

teT teT 

n>k 

Now choose {n^ : k > 1} to be a strictly increasing sequence of integers 
such that ni > 4 and 

2£ 2 < J < fc 1 A? < n k _ lm (32) 

Based on the n^'s we define an increasing sequence of partitions, B n . For 
< n < ni we let i3 n = yl n . For ni < n < n<i we proceed as follows. 

First we choose a maximal set {si, . . . , SAr( £2 )} of (T, d) for which c?(sj, Sj) > 
62- By Sudakov's inequality we know that Nfa) < 2 1 / e 2. To define the par- 
titions for ni < n < n 2 we next consider the partition of T formed by the 
sets 

Cj = B( Sj , e 2 ) n (U{Z\B(s k , e 2 )) c , 1 < j < N(e 2 ), (33) 

and the sets B(s,e) are e balls centered at s. Then by Theorem 2.1.1 of 
[Tal05] for every integer 1 < j < A^(e 2 ) there exists an admissible sequence 
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of partitions for (Cj, d), which we denote by Bn\,n, such that 

2" 2 >Esupl,>i snp Y. 2n ' 2 ^ B nun(t)) 

{tec,} 2L {teCj } n > 

>^ snp J2 2"/ 2 A(A ni (t)n^ jn (t)). 

i teC j} ni <n<n 2 

Since the sets Cj form a partition of T, if B n ^ n is one of the sets, Bn 1>n (t), 
then 

2- 2 >-Unp 2 n/2 HA ni (t)nB nun (t)), 

ni<n<n,2 

and we define the increasing sequence of partitions B nitn to be all sets of 
the form A ni (t) fl £>„ ljTt (t), where i G T and B ni ^ n G Bni,n(t) for some j G 
[1, N(e 2 )]- Furthermore, since the C/s are disjoint, for rii < n < n 2 we have 

Card(B ni , n ) < 2 2ni 2 2 "N(e 2 ) < 2 2 " +1 2 1 ^ < 2 2 " +1 n 1 < 2 2 " +1 2 2 " = 2 2 " +2 , 

(34) 

and for n x < n < n 2 we define B n = B nitTl . 

Iterating what we have done for rii < n < n 2 , we have increasing parti- 
tions B nk _ un , n k -i <n <n k , for which 

(2L)2- fe >sup^2"/ 2 A(5 nfc _ lin ) 

* n>0 

> sup Yl 2n ' 2 ^( B n k -un{t) n B nk ^ nk _ x {t) • • • n B niiH2 (t) n A ni (t)), 

n fc _i<n<ra fe 
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and for rik-i < n < we define B n = B nk _ un . Therefore, we now have an 
increasing sequence of partitions {£>„ : n > 0} such that 

(2L) ^2- fc > ^ sup £ 2"/ 2 A(5 nfc _ lin (t) n {n)zlB nj _ un] {t)) n 

fc>r fc>r n k _ 1 <n<n k 

> su P ^ £ 2"/ 2 A( J B„ fc _ 1 ,„(t) n (n;: 2 1 B nj , 1 ,„ i (t))nA 11 (t)) ) 

fc>r n k _i<n<n k 

and, letting denote the generic set of <B n containing t, we have 

(2L)^2- fe >sup^ 2 " /2 A(5„(t)). (35) 

k>r k>r n k _ 1 <n<n k 

Now to count the number of elements in each partition. Since (1) holds, the 
partitions Bn k _ un are assumed admissible, and the C/s used at the subse- 
quent iterations are always disjoint, we have for rik-i < n <rik that 

Card(£„ fc _ lin ) < 2^[^2^N{e 3 )]2^N{e k ) < < 2 2 " +2 . 

Given the increasing sequence of partitions {B n : n > 0}, we now define 
the partitions A n to be the single set T for n = 0, 1 and «4.„ = £>„_ 2 for n > 2. 
Since we have Card(£> n ) < 2 2 ™ +2 for n > we thus have that the „4 n 's are 
admissible and using (4) above they satisfy (iii). □ 
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